ON THE CONGRUENCE ax + by=l MODULO xy 



J. BRZEZINSKI, W. HOLSZTYNSKI, AND P. KURLBERG 

Abstract. We give bounds on the number of solutions to the Diophantine equation {X + l/x){Y + l/y) = n 
as n tends to infinity. These bounds are related to the number of solutions to congruences of the form 
ax + by = 1 modulo xy. 



1. INTRODUCTION 

Erik Ljungstrand has asked the first author about estimates of the number of solutions to the equation 

(1) 

where n, X,x,Y,y are positive integers satisfying n > 1, x > 1 and y > 1. His computations suggested 
that the number of such solutions, when symmetric solutions obtained by transposing (X, x) and {Y, y) are 
identified, is always less than n. 

It is easy to see that y divides xX + 1 and x divides yY + 1. Denoting the corresponding quotients by b 



and a, we get the following system: 



where ah = n. Thus 



ax = yY + 1, 
by = xX + 1, 



ax = 1 (mod y) and by = 1 (mod x) . (2) 

It is clear that the integers x, y satisfying these congruences are relatively prime, and the system is 
equivalent to 

ax + by = I (mod xy). (3) 

It is also clear from the equations above that x ^ y, so when counting the solutions, we may assume 
X < y. It is not difficult to see that the problem of finding all solutions to equation (1) with 1 < x < y is 
equivalent to the problem of finding all solutions to the systems of linear congruences (2) for all a, b such 
that ab — n with x, y satisfying the same conditions (see Section 2). 

One of the aims of the present paper is to prove E. Ljungstrand's observation concerning the number f{n) 
of solutions to equation (1). The proof is a combination of an estimate of f{n) (see Theorem 3) proving 
the result for relatively big values of n and a portion of numerical computations, which together prove the 
inequality f{n) < n for all n. The systems of linear congruences (2) or the congruence (3) (for fixed a, b) seem 
to be interesting on their own rights. In the paper, we study the sets of solutions to these congruences and 
give some estimates for their size both from above and below. We give also a reasonably effective algorithm 
for finding all solutions of (1) in positive integers and attach some numerical results. In the last part of the 
paper, we study the arithmetic mean of the function f{n) and give some lower and upper bounds for its size. 

2. CONGRUENCES 

Our objective is to estimate the number of solutions with x,y > 1 to the congruence ax + by = 1 (mod xy) 
when a5 = n is fixed. 



1991 Mathematics Subject Classification. Primary 11D45, Secondary 11A25, 11D72. 
Key words and phrases. Diophantine equation, linear congruence, divisor function. 



1 



2 



J. BRZEZINSKI, W. HOLSZTYNSKI, AND P. KURLBERG 



Theorem 1. Let a, b be fixed positive integers and ab = n > 1. Let p{a, h) denote the number of pairs {x, y) 
of integers x,y such that xy \ ax + by — 1, 1 < x < y. Then for every n > 1 and for every real number 
1 < a < ^/n, 

1 ^, , , ^ / 0.6\ _ (2n- l)a 



p{a, b) < -\/nlog(n) + 21 



a \ a J 'i\pn — a 

Before we prove the Theorem, we need two preparatory results. Let 9{n) denote the number of divisors 
to n. 

Lemma 1. Let n>22 be a natural number and 1 < a < ^/n a real number. Then 

^ 9{n-k) < -Vnlog(n) + 2 [l + — j y^. 



k=l 



a I 



Proof. We have (see e.g. [2], p. 347): 



^\/" \/" 

E^(-^)=E E 1^2^ E i<2E(^ 

/s=l fc=l d\n—h fc=l d\n—k ii=l 

l<d<yn 



d 



< -s/n(\ogy/n + 0.Q) + 2y/n= -^/nlogn + 2(1 + —)^, 
a a a 

where the last inequality follows noting that ^) — logn is decreasing and less than 0.6 when n > 22. 

□ 

Lemma 2. Let a, b, x, y be positive integers such that ab = n, ax = 1 (mod y), by = 1 (mod x) and x,y > 1. 

Let ax — 1 = yY, by — 1 = xX and ax + by — 1 = kxy. Then 

(a) k = n~XY, 

(6) x = ^ andy=^, 

(c) max(a;,y) < fffi, 

{^k<^. 

Proof. We have 



xyXY = {ax — l){by — 1) = abxy — ax — by + l = abxy — kxy. 
Dividing by xy, we get (a). Now ax — yY = by — xX gives x{a + X) = y{b + Y), so = ^i^. But 

fax — 1 \ 

kxy = ax + by — 1 = ( \-bjy={Y + b)y 

shows that both fractions are equal to k, which proves (b). We have 

ab—k , ab — k , ab — k (2ab — l)k 

where the last inequality follows from b + Y = kx > 2k, and the first is equivalent to 

, ab—k ab—k ab—k 6—1 — 1 

ab — a = aib — 1) > = • = X , 

^ ' - Y b + Y ~1 Y b + Y -I kx-V 

that is, a{kx — 1)>X, when b ^ 1. This is equivalent to akx > a + X = ky, which immediately follows from 
ax = yY + I > y. By symmetry, we get the corresponding inequality with y replaced by x, which proves (c). 
Since x,y>2 and, of course, x ^ y,we have max{x,y) > 3. Thus (c) implies (d). 

□ 

Proof of Theorem 1. Let 1 < a; < y be integers such that xy \ ax + by — 1. Notice that given y there 
is only one x satisfying the necessary condition ax = 1 (mod y) and therefore at most one pair {x, y) such 
that xy\ax + by— 1. 

Using notations from Lemma 2, we have XY — ab— k = n— k<n. Observe that X and Y are positive, 
since x > 1 and y > 1. We consider contributions to the numbers of solutions in two cases. 
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First of all, let k > \\pn^ where 1 < a < \fn. Then according to Lemma 2 (c), we get 

2n-l 2n-l (2n - l)a 

w < < = — 

^ - 2A;- 1 - |v^- 1 2^-01 

Since every y gives at most one x, we have less than possibilities for {x, y) in this case. 

Assume now that k < -^^n is fixed. Then, since X \ n — k, we get at most 9{n — k) possibilities for its 
choice. But k and X uniquely define y, and consequently, x. Therefore the number of possibilities for {x, y) 

in this case is at most J2k=i ~ ^)) which according to Lemma 1 is less than: 

-\/nlog(n) + 2(1 + — ) ^/n. 
a \ a / 

Thus the total number of possible {x, y) is at most: 

1 ^, , , „ / 0.6\ r- (2n- l)a 
-^\og{n) +21 + — V^+ \ ^ ■ 
a \ a J 2-^n — a 

□ 

Notice that if we fix < ^^Jn and choose X as a divisor to n — fc, then x and y are uniquely determined 
regardless of whether x < y ov x > y. In fact, k and X uniquely determine y, Y (from XY = n — k) and, 

consequently, x from Lemma 2 (b). Thus if wc arc interested in the total number of solutions to (3) without 
the assumption x < y, then we have to count twice the number of solutions corresponding to k > ^\/n (they 
may correspond to a; < y or a; > y) plus the number of solutions corresponding to fc < ^\/n. Thus we have 

Theorem 1'. Let a, h he fixed positive integers and ab = n. Let p' {a., h) denote the numher of pairs [x, y) of 
integers x, y such that xy \ ax + by — 1, x,y > 1. Then for every integer n > I and every real 1 < a < ^/n, 

// ,x 1 ^, / X „ A 0.6\ r- 2(2n-l)a 
p'(a,6) < -^/nlog(n) + 2 1 ' \ /z: , ^ > 



a \ a J 2y/n — a 

□ 

For completeness of our discussion of the congruence (3), we note: 

Proposition 1. The congruence ax + by = 1 (mod xy) has infinitely many solutions in positive integers 

X, y if and only if a = I or b ~ 1. 

Proof. As we already know, there is only finitely many solutions with x,y > 1. Therefore, if we have 
infinitely many solutions, then in infinitely many of them a; = 1 or y = 1. If for example, a; = 1 then 
infinitely many y divide a — 1, so a = 1. The converse is trivial. □ 

3. THE EQUATION 

In this section, wc discuss the number of solutions to equation (1), give an estimate of it and prove that 

for big values of n, it is always less than n. First wc note: 

Theorem 2. (a) The solutions {X,x,Y,y) to the equation (1) with 1 < x < y are in a one-to-one corre- 
spondence with the quadruples {x, y, a, b) such that ab = n, 1 < x < y and ax + by=l (mod xy). 

(6) The solutions {X,x,Y,y) to the equation (1) with fixed value k ^ n — XY > 0, x,y > 1, X < Y , 
X < y if X = Y, are in a one-to-one correspondence with the set of the quadruples {X, Y, a, b) satisfying 



n^ab>n-k = XY, k\ gcd(o + X,b + Y), (4) 
where a + X > k,b + Y > k, X < Y and a < b if X = Y. Moreover, for every solution {X, x, Y, y) to the 

equation {!), x = and y = 

Proof, (a) As noted in the introduction, a solution {X,x,Y,y) to equation (1) with 1 < a: < y gives the 
congruence ax-\-by= 1 (mod xy), where a = and b = «^ = n. Conversely, if {x, y) is a solution 

to ax + 6y = 1 (mod xy), where ab = n and 1 < a; < y, then we easily check that {X, x, Y, y) with X = 
and Y — is a solution to equation (1). 

(b) Let {X, X, Y, y) be a solution to equation (1) with k = n — XY, x,y > 1, X <Y, and x < y ii X = Y. 
Then with a, b as above, we get a quadruple {X, Y, a, b). According to Lemma 2, n = ab>n — k = XY, 
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k I gcd(a + X,b + Y), and x — y = Hence x,y > 1 imply a + X > k and b + Y > k. Moreover, if 

X = Y, then x < y gives a < b. 

Conversely, if (X, Y, a, b) is any quadruple satisfying the conditions in (b), then we get {X, x, Y, y), where 
X = and y = which is easily seen to be a solution of the equation (1) satisfying all the conditions 

in (b). □ 

Remark 1. Notice that the condition k \ gcd(a + X,b + Y) is equivalent to gcd(a + X,b + Y) = k, since 

k = ab-XY = {a + X)b- X{b + Y) 

implies that gcd(a + X,b + Y) \ k. Moreover, if gcd(n, fc) = 1, then the conditions k \ a + X and k \ b + Y 
are equivalent. In fact, gcd(n, A;) = 1 implies gcd{X, k) = gcd(6, k) = 1, so the identity above implies the 
equivalence of both conditions. Thus if gcd(n, fc) = 1, then in order to find a solution to equation (1), it is 
sufficient to find factors a of n and X of n — fc such that k\ a + X with a + X > k and - + > fc. Then 

1^ '"^ fc X fc J 

is a solution. In particular, if a = 1, we obtain solutions for every fc, X such that gcd(fc, n) = 1, 

X I n - fc, fc I X + 1 and X + 1 > fc. (5) 
On the other hand, if a = n, we get solutions for fc, X such that gcd(fc, n) = 1, 

X I n - fc, fc I n + X and 1 + ^^^^^ > 1. (6) 

X 

We shall use these observations frequently in Section 6. 

Theorem 2 (a) implies that in order to estimate the number of solutions to equation (1), we have to 
estimate the number /(n) = Yliab=n Pi°'^ ^) solutions with 1 < a; < y to all the congruences ax + by = 1 
(mod xy) when ab = n. It is well known that for every £ > there is a constant Cg only depending on s 
such that 6{n) < Cerf. Applying this fact and Theorem 1, we get a bound on /(n) depending on n, a and 
e. However, we can get a somewhat sharper estimate noting that we can only use one of the congruences 
ax + by = 1 (mod xy) and bx + ay = 1 (mod xy), but instead, taking all possible solutions with x,y > 1 
(that is, removing the assumption x < y). In fact, it is clear that {x,y) solves the first congruence if and 
only if (y,a;) solves the second one. In such a way, we can use the estimate from Theorem 1', but only for 
the pairs a, b with ab = n and a < b. The number of such pairs is ^9{n) + e„, where e„ = if n is not a 
square and e„ = |, when n is a square. This gives the following result: 

Theorem 3. Let f{n) denote the number of solutions to the equation (1) and let 

1 ^ f 0.6\ ^ 2{2n - 1)q 



g{n, a) = -VElog{n) + 2(1 + —) 

a \ a / 



2n — (x^fn 

Then for every £ > and any real 1 < a < \fn there is a constant Ce such that 

f{n) < lein)gin,a) < C,n' (^^\og{n) + f 1 + — ) V^+ ~ 
when n is not a square, and 

fin) < ^{0{n) + l)g{n,a) < {C,n' + 1) (J-V^\og{n) + (^1 + 
when n is a square. In particular, if n is sufficiently big then f{n) < n. 



(2n - l)a 
2\fn — a y ' 
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4. AN ALGORITHM 

We can now construct a reasonably efficient algorithm for computing the number of solutions (A, x, Y, y) 
to equation (1) following their description in Theorem 2 (b). 

First of all, write down the divisor list of n. For each divisor a of n and for all integers X such that 
1 < A < y/n, repeat the following: Compute all the divisors A; of a + A, for each fc, check whether Y = 
and X = where & = ^, arc integers or not, put y = x = in the former case. If A = F 

and X > y replace {x,y) by {y,x). Check whether x > 1, y > 1 and accept the quadruple {X,x,Y,y) as a 
solution if all these conditions are satisfied. 

Theorem 2 (b) easily implies that this algoritm gives all the solutions to equation (1) and every solution 
exactly once. 

We are now ready for the numerical computations proving that the number /(n) of solutions to equation 
(1) is always less than n. 

As we noted before, for each e > there is a constant Cg only depending on e such that 9{n) < C^-nf for 
all n > 1. For simplicity, let e = | and denote by C* the least constant corresponding to this value of e. It 
is easy to show that on the positive integers the quotient 



attains its maximum value for n = 21621600, which gives C* < Co = 8.44697. 
According to Theorem 3, if n is not a square, we want to decide when 

fin) < -e(n)g(n,a) < —C*ni log(n) + C*ni + ~ ^^" c*n^ < n. 

2 2a \ a J 2-^n — a 

Let 

h{n,a,C)=n^ - — Clog n - 1 + — C-^ ^C. 

2a \ a J 2n — ay/n 

Choose a = 2.95. Then it is easy to check that h{n,a,C*) > h{n,a,Co) > when n > 11621000. By 

the definition of C*, this shows that /(n) < n for all n > 11621000 and it remains to check this inequality 

for all n < 11621000. In order to carry out the numerical computation, we find all the numbers n for 

which ^9{ri)g{n,a) > n. This happens when 0{n) is "big", which occurs for n having many small prime 

factors. The computations give 6523 numbers in the interval [2 • 10^, 11621000]: 3030 in [2 • 10^, 10^], 3482 

in [10^5 • 10^] and 11 in [5 • 10^ 11621000]. The numbers in the last interval arc 5045040 (4559), 5266800 

(4051), 5405400 (5069), 5569200 (4494), 5654880 (4534), 5765760 (5286), 6126120 (5211), 6320160 (5407), 

6486480 (4333), 7207200 (6309), 8648640 (5330), where the number in the parenthesis is the corresponding 

value of f{n). 

If n is a square, then we repeat the same procedure as above taking into account the extra term on the 
right hand side in the second inequality in Theorem 3. The bound 11621000 works in this case as well, so 
we have to consider all squares less than this bound (3408 numbers). Short computations show that there 
are 118 such squares for which the expression in the second inequality in Theorem 3 is not less than n (the 
biggest one 1587600). For these 118 numbers, we check by computer calculations that /(n) < n. 

5. REDUCED SOLUTIONS 

The main aim of this section is a non-computational proof of the inequality /(n) < n for the case when 
n = p is a prime number. We also give some estimates of the number k = n — XY for the solutions A, x, Y, y 
to equation (1). 

Let A, X, y, y be a solution to equation (1), which in this section will be denoted by n = [A, x, Y, y]. Recall 
that a, b denote integers such that ax = yY+1 and by = xX + 1. Wo say that a solution A, x, Y, y is reduced 
if A < y and Y < x. The reduced solutions are characterized in the following way: 

Proposition 2. Let n = [A, x, Y, y] . Then X, x, Y, y is reduced if and only if XY = n — 1. 

Proof. If A < y and Y < x, then Lemma 2 gives kxy = ax + by — 1 = xX + Yy+1 < x{y — l) + y{x — l) + l = 
2xy + l — x — y < 2xy. Thus k = n — XY = 1. Conversely, if XY — n—1, then by Lemma 2, k — n — XY = 1. 
This implies X <y and Y < x, since otherwise, kxy = xX + yY + 1 > xy, that is, A; > 1. □ 
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Corollary 1. The number of reduced solutions to the equation (1) is ■^9{n)0{n — 1). 

Proof. If X, y, Y,y is a, reduced solution, then ab = n, XY = n—1 and k = 1 according to Lemma 2. Thus 
each pair of divisors to n and n—1 defines a solution and every solution gives such a pair of divisors. Of 
course, we have to divide by 2 the total number of such pairs in order to obtain each desymmetrized solution 

exactly once. □ 

Proposition 3. If p is a prime, then f{p) < p. 

Proof. According to Corollary 1, the number of reduced solutions to p = [X, x, Y, y] equals 0{p— 1). Assume 
that the solution X, x, Y, y is not reduced. Without loss of generality, we may assume that 

xX + 1 = py and yY + 1 = 2:. 

The second equation gives Y < yY + 1 = x. Since the solution is not reduced, we have X > y (the equality 
is of course impossible by the first equation). The second equation gives y\x — 1, so y < x. We also have 
x < p, since otherwise py = xX + 1 > pX gives a contradiction. Thus x belongs to the set {3, . . . ,p — 1} 
with p — 'S elements. Moreover, py =1 (mod x) and y < x, so the congruence allows at most one y giving a 
solution to the equation. If now p=l (mod x), then y must be equal to 1, which is impossible. Thus x>2 
can not assume values dividing p — 1. The number of such x is 9{p— 1) — 2. Thus x assumes at most 

(p - 3) - (^?(p - 1) - 2) = p - f?(p - 1) - 1 

different values which give non-reduced solutions. According to Corollary 2, the number of reduced solutions 

is 0{p — 1) so the total number of solutions is at most p — 1. □ 
Every solution X,x,Y,y to equation (1) has the corresponding value of fc = n — XY. By Proposition 
3, fc = 1 corresponds to the reduced solutions. For these solutions, X and Y must be the least positive 
solutions to the congruences xX = —1 (mod y) and yY = —1 (mod x) when x,y are fixed. All other 
positive solutions to these congruences, with x, y fixed, are given hy X + ry, Y + sx where r,s > 0. Thus 
starting from n = [X, x, Y, y] with a fixed pair x, y, we get 

N — [X + ry, x,Y + sx, y] , 

where A'^ = {rx + b){sy + a). The number n = a6 is the least number for which such a (reduced) solution with 
fixed X, y exists. We have N — {X + ry){Y + sx) = k + r + s. In particular, if r = 1, s = or r = 0, s = 1, 
we get quadruples for which the corresponding parameter k decreases by 1: 

[X,x,Y,y]^[X + y,x,Y,y], [X,x,Y,y] ^ [X,x,Y + x,y]. (7) 

We shall say that these two transformations are elementary. Thus we can describe the solutions for a given 
n in the following way: 

Proposition 4. Every solution to n — [X, x, Y, y\ with k = n — XY > 1 can be obtained from a reduced 
solution to m — [Ao,x, Yq,?/] for some m < n, by successive use of k — 1 elementary transformations (7). 

Proof. If we have a solution n = [X, x, Y, y] with k = n — XY and A; > 1, then the solution is not reduced, 
which means that X > y ov Y > x, since Lemma 2 implies immediately that the equalities are impossible. 
If X > y, then we get n — [yY + 1) — [X — y, x, Y, y], while Y > x gives n — (xX + 1) = [X, x,Y — x, y] 
both with the corresponding value of k' = [n — {xX + 1)] — X{Y — x) = k — 1. This "reduction process" 
eventually leads to a reduced solution for a natural m <n and the same x,y. Starting from such a reduced 
solution and reversing the process, we get the given solution n = [X, x, Y, y] after fc — 1 steps. □ 
By Lemma 2 (d), k < Observe, that for t > 1 and n = 3i — 1, we have n = [1, 2, 2t — 1, 3] and in 

this case, k = n- XY = 

6. SOME ESTIMATES 

We wish to give upper and lower bounds on the number of solutions f{n) to equation (1) when n is 
averaged over some interval. For simplicity, if g, h are positive functions, we write g{n) <C h{n) if there is a 
positive constant C such that g{n) < Ch{n) for all sufficiently big natural n. 
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Theorem 4. There exist positive constants Ci, C2 such that for T >2, 

Tlog^T 



In the proof wc need the foUowing result: 
Lemma 3. E„<t^W^("- 1) = ^(riog^T). 
Proof. If m < T, we have 

e{m) < 2 ^ 1 



and thus 



l\m 

i<Vt 

^e(n)e(n-l)<4^ 5^ 5^ 1 

n<T n<T l\n k\(n-l) 

1<\/T k<\/T 

4 J2 \{n<T:l\n,k\{n-l)}\=A ^ \{d <T/l : k\{dl - 

k,l<VT k,l<VT 

= 4 ^ \{d<T/l:d=l-'^ modfc}|<4 ^ + ^ 



k,l<VT k,l<VT 

= 0{T log^ T) + 0{T) = 0{T log^ T). 



□ 



Proof of Theorem 4. With the notations from the introduction, given x,y, let us choose Xq and Yq such 
that xXq = —I (mod y), < Xq < y, yYo = — 1 (mod x) and < Iq < a;. We want to count the number of 
integers X,Y > 1 such that X = Xq (mod y), Y = Yq (mod x) and 



X + - ] [Y + - ] =n<T. 



when x,y > 1 are fixed. Noting that 

XF<&±ll(^<4Xy 
xy 

we will obtain lower bounds by estimating from below the number of X, Y such that 4XF < T. 
The congruences X = Xq (mod y), Y = Yo (mod a;) are equivalent to X,Y being of the form 

X = Xo + ry, Y = Yo + sx 

for r, s non-negative integers. Thus it is enough to estimate 

#{r, s>0:{Xo + ry){Yo + sx) < T/4}, 

which, since Xq < y and Yq < x, we may bound from below by 

#{r, s > : (r + l)(s + l)xy < T/A). 

This, in turn, is greater than 

T T T 

#{r, s>0:rs< ——} ~ -- — log — — • 
loxy loxy loxy 

Summing over x,y < T^/^, we then find that there are 

x,i/<Ti/3 ^ " a;,j/<Ti/3 " 



>TlogT V — >Tlog3r 



^ xy 
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ways of finding a;, y, X, Y such that 



{Xx + l){Yy+l) 
xy 



< T. 



In other words, on average, there are at least Ci log^T solutions for some Ci > 0. 

In order to prove the existence of an upper bound, we note first that if r = s = 0, then the solution 
[Xq, Yo,x,y) is reduced. For n <T the number of reduced solutions is according to Corollary 1 and Lemma 
3, 



Y^\0{n)e{n-l)=O{T\og^T). 



n<T 



Assume now that r > 1 and s = 0. Then the number of solutions {X,Y,x,y) to equation (1) such that 
n < T is less than 



#{r, x,y>0:XY^{Xo+ ry)Yo < T} < #{r, x,y>0: ryYo < T}. 
Since r > 1, we have M := yYo < T and since yYo = — 1 (mod x), we have 

#{r,x,y>0:ryYo<T} < ^ #{r,x,y > : x \ M + 1, y \ M, rM <T} 

M<T 

< e{M)e{M + i)T/M, 

M<T 

which, by partial summation and Lemma 3, is 0(T log^ T). 

The case r = 0, s > follows in a similar way to the previous one. 

Finally, if r, s > and {X, Y, x, y) is a solution to (1) such that n <T, then since XY < T, we get 

#{r, s,x,y>0:XY={Xo + ry){Yo + sx) < T} 
< #{r,s,x,y> : rysx <T} = 0{T\og^T). 

In other words, on average, there are at most C2 log^ T solutions for some C2 > 0. □ 
What else can be said about the size of f{n)l For instance, how close is f(n) to its average? As the 
following figure shows, /(n) oscillates rather widely. 

550 
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10050 10100 10150 10200 10250 10300 10350 10400 10450 10500 



Figure 1. The numbers of solutions to (1) for 10000 <n< 10500 

Since there is ^9{n)9{n — 1) reduced solutions (see Corollary 1), it is clear that the order of magnitude of 
/(n) sometimes is larger than any power of logn. Moreover, there are other sources of large oscillations. 
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Let M{n, k) denote the number of solutions X, x, Y, y to equation (1) such that n — XY = k. Of course, 

f{n) = ^M{n,k). (8) 

k 

Taking into account the contribution to /(n) from the number of solutions with k = 1 and a similar 

contribution for = 2 (see below Lemma 4), one might expect that the most significant fluctuations of /(n) 
depend onM{n, k) for small values of k. However, this is not the case as shown by the following construction 
(we thank Andrew Granville for pointing this out to us): Fix an arbitrary k and let M > /c be a large integer. 
Choose n = fc + HPsi where Pi are all primes such that pi = —1 (mod k) and pi < M. Denote the number 
of such primes pi by 7r(M, k, —1). By the prime number theorem for arithmetic progressions (see [1], Chap. 
20 and 22): 

<.(M,fc,-l)< 



(j>{k) log Af - ' ' ' ' - (j){k) log M 
for suitable positive constants Cfe, Ck only depending on k. Now, half of the divisors to n — fc are congruent 
to —1 modulo fc, so taking into account (5), we get 

f{n) > 2-(A^.fe.-i)-i. 

Hence log/(n) > 7r(M, A;, -1) » ^(kyfogM - ^^^^^ U.Pi ^ M'"^^'''-^\ we get 

logn«2^1ogpi«: 
and similarly, logn » which implies logM <C log log n. Thus 



log/(n) > > T—jj > 



Hence 



(j){k) log M log M log log n 
clogn 



/(n) » exp 



log log n 



for some constant c > only depending on k. 

This shows that arbitrary k may give "big" contribution to f{n) for a suitable n. It is also possible to 
show that the contribution to f(n) may come from many different values of fc. If n + 1 has many different 
divisors, then according to (6), where we choose X ~ 1, each such divisor k gives a solution to the equation 
(1). Unfortunately, we are unable to prove that /(n) ^ oo when n — > oo. What we prove with "some effort" 
is 

Proposition 5. Ifn>9, then f{n) > 8. 

Let ^odd('^) denote the number of odd divisors of n. Then for fc = 2, we have the following result: 
Lemma 4. For n> 3, we have 



M{n,2) 



^0{n) 6{n — 2) — 1, ifn is odd, 
6odd{n) Oodd{n - 2) - 1, ifn is even. 



Proof. In fact, if n is odd, then according to Remark 1, we get all solutions to (1) taking any divisor a to 
n (fe = ^) and any divisor X to n — 2 {Y ~ ^^^) such that a + X > 2 and b + Y > 2. The number of pairs 
of such divisors giving different quadruples {X, x, Y, y) with x < y is ^0{n) 9{n — 2) and the only case when 
a + X = 2orb + Y = 2 corresponds to the choice ofa = X = lora = n, X = n — 2, which gives only one 
quadruple with x < y. This proves the first case. 

If n is even, let n = 2''m, where m is odd. One of the numbers n,n — 2 must be divisible by 4, so let us 
assume that r > 2 (the case with n — 2 divisible by 4 is considered in similar way with the roles of n, n — 2 
interchanged). Thus n — 2 — 2{2^^^m — 1), and ^^^^^ is odd. If n — 2 = XY, then exactly one of the factors 
X, Y is even and the other one is odd. Since a + X and b + Y are even, exactly one of the factors a, b of 
n = ab must be odd. Thus all the possibilities for the sums a + X and b + Y are given by all the choices of 
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the odd factors of n and n—2. Only one such choice gives a + X = 2orb + Y = 2. This proves the second 
case. □ 
Now we prove that 

if n>ll, then M{n,l) + M{n,2) >7. (9) 

First let n be odd. Then 

M{n,l)+M{n,2) = ^0{n){e{n - 1) + 0{n - 2)) - 1 

Since n — 1 > 4 is even, 6{n — 1) > 4. Assume that 9{n) = 2. Then n is a prime. If also 6{n — 2) = 2, then 
6 I n — 1. Since n — 1 > 6, we have 0{n — 1) > 6, so M{n, 1) + 2) > 7. Assume now that 0(n — 2) = 3, 

that is, n — 2 = p^, where p > 3 is a prime. Then 3 | + 2 = n, which is impossible. Thus 9{n — 2) > 4, 
which gives M.{n, 1) + A^(n, 2) > 7. Notice that if n is a prime, n — 1 twice a prime and n — 2 is a product of 
two different primes, then A^(n, 1) + A^(n, 2) = 7. By Schinzcl's conjecture (sec [3]), this situation happens 
for infinitely many n. If 9{n) > 2, then it is easy to check that M{n, 1) + M{n, 2) > 8. 
Assume now that n is even, so 

M{n, 1) +M{n, 2) = ]^0{n)e{n - 1) + ^?odd(n)0odd(n - 2) - 1. 

We have 0{n) > 3, since n > 4. Assume 0{n) = 4. Since n > 8, we have n = 2p, where p is an odd prime. 
If n-1 is a prime, then 3 | n-2 = 2(p-l), so 6'odd(n)6'odd(«-2) > 4 and A^(n, l)+7W(n,2) > 7. If 6'(n) = 5, 
then n = 16 and the claim follows by a direct computation. If 9{n) = 6, then n = 32 or n = p^q for two 
different primes p, q. If p = 2, then n — 2 = 2{2q— 1) has at least two odd factors, so M.{n, 1)+M.{n, 2) > 7. 
li q = 2 and p = 3, we check the claim directly, and when p > 3, then n — 2 = 2(p^ — 1) is divisible by 3, so 
^odd(?^)^odd('^ — 2) > 6. If finally, 6{n) > 7, then of course, the inequality holds. 

Now we prove that 

if n > 12, then M{n,3)>l. (10) 
Assume first that 3 \ n (so 3 | n — 3) and let n bo even. Then n = 2m and n — 3 = 2m — 3. If for a prime 
p=l (mod 3), p I n - 3, then p> 7, so X ^p,Y = x = ^ > 1 and y = '-^^ > 1 (see Remark 1) 
give a solution to equation (1). If for a prime p = 2 (mod 3), p | n — 3, then p > 5, so X = 1, Y = 

X = 2±il > 1 and y = > 1 give such a solution. If n is odd, then n — 3 is even and we repeat the same 
arguments looking instead at the prime factors p of n. 

Let now 3 | n. Let n be even. Then n = 3*2m, where 3 f to, and n — 3 = 3(3*~^2to — 1) = 3r. If r has a 
prime divisor p = 1 or 2 (mod 3), we proceed exactly as in the previous case above when 3 \ n. Otherwise, 
2to — 1 is a power of 3, so n — 3 = 3'*+^ and n = 3(3" + 1). In this case, n must have a prime factor p > 2 
congruent to 2 modulo 3 and we get a solution to equation (1) as before. 

If n is odd, then n — 3 is even and divisible by 3, so the considerations are similar with the role of n and 
n — 3 interchanged. 

Now the proof of Proposition 7 follows immediately from (8), (9), (10) and by direct inspection of the 
cases n = 9,10,11,12. Still more elaborate arguments show that f{n) > 12 if n > 20 (we thank Jerzy 
Browkin for sending us his proof of this result and, in particular, for the proof of Lemma 4). □ 
Remark 2. It is no longer true that Al(n,4) > 1 for all sufficiently large n. If all primes dividing both n 
and n — 4 are congruent to 1 modulo 4, then by Remark 1, there are no solutions to equation (1) with fc = 4. 
In fact, this happens for infinitely many n by the following argument, for which we thank Mariusz Skalba. 
Let m be a natural number such that to ^ 1 (mod 3) and put n = 2w? + 2to + 5. Then 

n = (to - 1)^ + (to + 2f and n - 4 = to^ + (m + 1)^ 
are only divisible by primes congruent to 1 modulo 4. 
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